Quality-biased Ranking of Short Texts in Microblogging Services

نویسندگان

  • Minlie Huang
  • Yi Yang
  • Xiaoyan Zhu
چکیده

The abundance of user-generated content comes at a price: the quality of content may range from very high to very low. We propose a regression approach that incorporates various features to recommend short-text documents from Twitter, with a bias toward quality perspective. The approach is built on top of a linear regression model which includes a regularization factor inspired from the content conformity hypothesis documents similar in content may have similar quality. We test the system on the Edinburgh Twitter corpus. Experimental results show that the regularization factor inspired from the hypothesis can improve the ranking performance and that using unlabeled data can make ranking performance better. Comparative results show that our method outperforms several baseline systems. We also make systematic feature analysis and find that content quality features are dominant in short-text ranking.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling Heterogeneous Networks for Information Ranking, Enrichment and Resolution on Microblogs

Microblogging, a new type of online information sharing platform through short messages of up to 140 characters, has grown up quickly and received increasing attentions in recent years. A microblogging platform (e.g., Twitter) enables both individuals and organizations to disseminate information, from current affairs to breaking news in a timely fashion, which makes it a valuable knowledge sour...

متن کامل

Evaluation and ranking of selected hospitals in Mashhad in terms of quality of services provided by the method of FAHP and GRA-TOPSIS

Background: Assessing and improving the quality of services in hospitals because deal with the health of humans is very important. The purpose of this study is to identify and weigh quality criteria and ranking of four hospitals in Mashhad.   Materials & Methods: The present study is of type  Applied Studies  that is a cross-sectional study conducted in the winter of 1396. In this study, by l...

متن کامل

Propagation-based Sentiment Analysis for Microblogging Data

The explosive popularity of microblogging services encourages more and more online users to share their opinions, and sentiment analysis on such opinion-rich resources has been proven to be an effective way to understand public opinions. On the one hand, the brevity and informality of microblogging data plus its wide variety and rapid evolution of language in microblogging pose new challenges t...

متن کامل

Domain Specific Document Retrieval Framework for Real-Time Social Health Data

With the advent of the web search and microblogging, the percentage of Online Health Information Seekers (OHIS) using these online services to share and seek health real-time information has increased exponentially. OHIS use web search engines or microblogging search services to seek out latest, relevant as well as reliable health information. When OHIS turn to microblogging search services to ...

متن کامل

Microblogging In Technology Enhanced Learning: A Use-Case Inspection of PPE

Microblogging is the latest variant of blogging which allows users to post very short messages. Due to its ease interface, the possibility of directly addressing other users, and several surrounding services microblogging becomes more and more used in scientific conferences as main back-channel. This paper discusses microblogging with Twitter as main information back-channel in an exemplary use...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011